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Abstract 


The main protease of SARS-associated coronavirus 
(SARS-CoV), also called 3C-like protease (3CLP"), 
is vital for the viral replication. It cleaves the repli- 
case polyproteins at 11 sites and is a promising drug 
target. Several groups of inhibitors have been iden- 
tified through high-throughput screening and ra- 
tional drug design. In addition to the pharmaceuti- 


cal applications, a mutant 3CLP"” (T25G) with an 
expanded S1’ space has been demonstrated to toler- 
ate larger residues at Pl’, facilitating the cleavage 
behind the recognition sequence. This review sum- 
marizes current developments in anti-SARS agents 
targeting 3CL™ and the application of the mutant 
protease as a tag-cleavage endopeptidase. 
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1 Introduction 


Respiratory viral infective diseases are renowned as a major 
threat to all ages worldwide, particularly children under 5 years 
of age [1]. Beginning in late 2002, a severe epidemic disease 
called severe acute respiratory syndrome (SARS) emerged in 
China and quickly spread to more than 30 countries and areas 
[2], causing a significant impact on health and economy. 
SARS-associated coronavirus (SARS-CoV) typically causes res- 
piratory and enteric diseases, pneumonia, exacerbation of asth- 
ma, neurological symptoms, and myocarditis in humans [3-5], 
with a mortality rate as high as nearly 10% of those diagnosed 
[6]. SARS-CoV is a positive-stranded RNA virus belonging to a 
family of enveloped coronaviruses [5,7]. The SARS-CoV ge- 
nome contains five major open reading frames (ORFs) that 
encode the replicase polyprotein, the spike (S), envelope (E), 
membrane glycoproteins (M), and the nucleocapsid protein 
(N). Extensive proteolytic processing of these nonstructural 
and structural polyproteins is a vital step for providing the 
functional proteins for SARS-CoV viral propagation, and such 
processing is mediated primarily by the nonstructural protein 5 
(nsp5), the main protease (M?") with a dimeric chymotrypsin- 
fold, also known as 3C-like protease (3CLP™) because of its 
analogy with the monomeric 3C protease (3CP™) form picorna- 
virus [8-10]. 

In contrast to the common serine proteases containing a Ser- 
His-Asp catalytic triad, SARS-CoV 3CL’ employs a Cys-His 
catalytic dyad (Cys145 and His41) in the catalytic site. This is 
similar to the porcine transmissible gastroenteritis virus 
(TGEV) 3CLP and the human coronavirus 229E 3CL? 
(HCoV 229E) using Cys144 and His41 [8]. According to the 
previous studies on the substrate specificities and proteolytic 
sites, SARS-CoV 3CL™ utilizes the substrates with bulky hy- 
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drophobic residues (mainly Leu or Ile), invariable glutamine 
residue, and small aliphatic residues (Ser, Ala, or Gly) at the 
P2, Pl and P1’ positions, respectively [8-13]. In addition, sec- 
ondary structural studies for substrates of SARS-CoV 3CLP” 
have revealed that substrates with more beta-sheet like struc- 
tures tend to be cleaved more quickly [12]. The determination 
of the crystal structures for HCoV 229E 3CL?” and the inhibi- 
tor complex of TGEV 3CLP" also confirmed a remarkable de- 
gree of conservation of the substrate binding sites for CoV 
3CL?” [8]. 


2 Characterization of SARS-CoV 3CLP"° 
2.1 Activity Measurements of SARS-CoV 3CLP"° 


SARS-CoV 3CLP° cleaves ppla (486kDa) and pplab 
(790kDa) polyproteins at the predicted 11 conserved sites 
(Fig. la) with a conserved sequence of (Leu, Met, or Phe)-Gln- 
(Ser, Ala, or Gly), where the glutamine residue is invariable at 
the P1 position (Fig. 1b) [12]. The reverse-phase high-pressure 
liquid chromatography (RP-HPLC) experimental system was 
used to analyze cleaved products by a linear gradient of aceto- 
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Figure 1. Schematic diagram of the SARS-CoV polyproteins, with cleavage sites of the two proteases, papain-like protease (PLP"°) and 
3C-like protease (3CL"°). Numbers 1-16 represent the nonstructural proteins (nsp) 1-16. (a) The SARS-CoV genome encodes four struc- 
tural proteins: spike (S), envelope (E), matrix (M), and nucleocapsid (N). (b) The cleavage sites and conserved amino acid sequences of 


PLP"? and 3CLP" aligned. 


nitrile-containing trifluoroacetic acid (TFA) and to screen in- 
hibitors, but it is a time-consuming process. A fluorogenic sub- 
strate, Dabcyl-KTSAVLQSGFRKME-Edans, with the fluores- 
cence quenching pair (Edans-Dabcyl) has been developed to 
facilitate enzyme assay and inhibitor screening (Fig.2) [14]. 
With this or other quenching pairs, such as 2-amino benzoyl- 
(Abz)-Tyr(NO ) [15], an intense increase of fluorescence was 
detectable upon the peptide cleavage by the protease due to 
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the effect of fluorescence resonance energy transfer (FRET). 
Accompanying a fluorescence plate reader, such a fluorogenic 
substrate can be applied to high-throughput inhibitor 
screening. In addition, fluorogenic peptides containing a 
SARS-CoV 3CL? consensus cleavage sequence and Rhoda- 
mine 110 [(Ala-Arg-Leu-Gln-NH) -Rhodamine] can also serve 
as substrates for the enzyme activity assay with high sensitivity 
(at low pM range) [16]. 
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Figure 2. The fluorogenic substrate used for the SARS-CoV 3CLP" inhibition assay. Enhanced fluorescence caused by cleavage of the 
fluorogenic substrate peptide can be monitored at 538 nm with excitation at 355 nm. 
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2.2 Dimerization of SARS-CoV 3CLP"° 


The protease contains three domains (I, II, and III) and the ac- 
tive site is located between domain I and II. The crystal struc- 
tures of the 3CL?” from TGEV, HCoV 229E, and SARS-CoV 
(8, 10, 17,18] reveal a common feature for CoV 3CLP"° with 
two chymotrypsin-like 6-domains (domain I & II) and one 
a-helical dimerization domain (domain III) which is absent 
from the picornavirus 3C and chymotrypsin [19]. In addi- 
tion, deletion of the first four amino acids in the N terminus 
significantly increased the dimer dissociation constant (Kp) 
and accordingly decreased the protease activity [20-21]. From 
analytic ultracentrifugation (AUC) analysis, sequential dele- 
tions of the first 3, 4, and 7 residues at the N terminus caused a 
12-, 205-, and 1275-fold increase in dimer Kp, respectively 
[20], indicating the role of Arg4 in dimerization. Besides the N 
terminus, a cluster of conserved serine residues (Ser139, 
Ser144, and Ser147) located adjacent to the active site, but away 
from the dimer interface of the protease initiates long-range 
cooperative interactions to modulate dimerization [22]. The 
mutation of Ser147 to Ala, which does not make any contact 
with the opposite subunit, totally abolished dimerization and 
enzyme activity. However, the measured Kp values of the dimer 
varied [14, 20, 23, 24], but a His-tag of the recombinant protein 
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for NiNTA purification might interfere with the dimer forma- 
tion [12]. 


3 Structure of SARS-CoV 3CLP"° 


From the crystal structure, the N-terminal residues 1-7 (N-fin- 
ger) are buried in the dimer interface with numerous contacts 
with the domain II close to the active site of the other protomer 
(Fig. 3) [10, 18, 25,26]. The first crystal structure of SARS-CoV 
3CLP" is complexed with a substrate-like hexapeptidyl chloro- 
methylketone (Cbz-Val-Asn-Ser-Thr-Leu-Gln-CMK) inhibitor, 
revealing the structural features of this protease [10]. The pro- 
tease forms a dimer with two protomers oriented almost at 
right angles to each other. The Cys-His catalytic dyad is located 
in an active site cleft between domains I and II, which adopt 
antiparallel $-barrel structures. Domain III, for mediating the 
tight dimerization, contains five a-helices arranged into a large 
antiparallel globular cluster. Another structure of the protease 
with C-terminal six amino acids from the neighboring proto- 
mer bound in the active site (product-bound form) reveals the 
detailed interactions of the protease with the P1 to P6 amino 
acids (QFTVGS) [18]. In this structure, two protomers (A and 
B) form a dimer, and the N terminus of protomer A is located 





Figure 3. Crystal structures of the wild-type and C145A mutant 3CL?®. In (a) and (b), the overall three-dimensional structures of wild- 
type (PDB ID 1Z11) and C145A mutant (PDB ID 1Z1J) 3CLP’° are shown as ribbons (protomer A is green and protomer B is blue), and the 
product in the active site cleft between domain | and II is shown in yellow. (c) The dimer structure is composed of protomer A (shown 
with a solid tube in green) and protomer B (shown with charge potentials). The C terminus of protomer B’ (shown in cyan) in another 
asymmetric unit that is intercalated into the active site of protomer B. (d) An enlarged view of (c) near the active site, showing the 
C-terminal amino acids of protomer B’ as well as the N-terminal amino acids of protomer A in the neighborhood of the active site of pro- 
tomer B. The active site of protomer B is shown in light yellow. 
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close to the active site of protomer B. However, in the active 
site of protomer B, extra electron density originating from the 
six C-terminal amino acids of protomer B’ (the nearby proto- 
mer in the crystal packing) was found. The side chain of Gln 
(the last C-terminal residue of protomer B’) is ~10 A away 
from the side chain of Ser (the first N-terminal residue of pro- 
tomer A), suggesting a positional shift after autocatalytic cleav- 
age of the premature protease. This structure provides the 
necessary information for rational design of the active-site in- 
hibitors [18]. The detailed interactions in the active site are as 
follows. In the S1 site, the side chain Os1 and Ne2 of Gln306 
(P1) form hydrogen bonds with Ne2 of the His163 imidazole 
ring and the carboxylate of Glul66. The oxygen atom of the 
side chain of P1-Gln forms H-bonds with the backbone NH 
atoms of Gly143 and Cys145. Therefore, the Sy atom of Cys145 
is at a suitable position to attack the peptide bond. The S2 site 
is formed by the main-chain atoms of Vall86, Asp187, Arg188, 
and Gln189 as well as the side chain atoms of His41, Met49, 
and Met165 to hold a bulky side chain, such as Val, Leu, or Phe 
at the P2 position. The N atom in the main chain of P2-Phe in- 
teracts with the O atom of His164 and the side chain interacts 
with Met49, Met165, Asp187, and Arg188 through hydropho- 
bic interactions. The side chain of P3-Thr is oriented toward 
the bulk solvent, so a hydrophilic residue can also exist at this 
position. The main-chain O atom of P4-Val accepts a H-bond 
(3.1 A) from the Ne2 atom of Gln189, and the N atom of the 
Val donates a H-bond to the Osl atom of Gln189, while anoth- 
er main-chain NH donates a H-bond (3.3 A) to Gln189. The 
side chain of P4-Val interacts with Met165 and Gln189 via 
hydrophobic interactions. S5 subsite is composed of the main- 
chain atoms of Thr190, Alal91, and Gln192. P5-Gly is not in 
contact with the protease. The S6 site is almost positioned at 
the outer area of the protein. However, the O atom and Oy of 
P6-Ser still interact with the backbone N and O atoms of 
Gln192. 


4 Inhibitors of SARS-CoV 3CLP"° 


The 3CL?® is essential for the propagation of the virus and, 
thus, serves as a promising target for anti-SARS therapy. Many 
inhibitors with micromolar and submicromolar inhibition con- 
stants against the SARS-CoV 3CLP have been identified from 
the high-throughput random screening using protease activity 
assays and cell-based assays, or the rational inhibitor design 
based on known inhibitors, active-site directed approaches tar- 
geting the cysteine residue, and structure-based virtual screen- 
ing, as described below. 


4.1 From High-Throughput Screening 


Several inhibitors against SARS-CoV 3CL”® identified by dif- 
ferent laboratories using high-throughput screening have been 
described previously [27]. Wu et al. identified ~50 active com- 
pounds by cell-based assay that block SARS-CoV replication, 
including two existing drugs, reserpine and aescin [28]. In the 
subsequent target verification, a compound named TL-3 that 
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was developed as a transition-state analogue of the HIV pro- 
tease (inhibition constant K; = 1.5nM and 4nM, against HIV 
protease and feline immunodeficiency virus protease, respec- 
tively) [29] was able to inhibit the SARS-CoV 3CLP” (K; = 
0.6 uM; measured by the Edans-Dabcyl FRET assay) (1, Fig. 4). 
Lopinavir (one of the two components from the anti-AIDS 
drug Kaletra® from Abbott) was shown to weakly inhibit the 
protease (K; = 15 uM), consistent with observed better clinical 
outcome for treating SARS patients with this drug in a Hong 
Kong Hospital [30]. Blanchard et al. had screened 50 000 drug- 
like small molecules to find five novel SARS-CoV 3CLP™ inhib- 
itors, MAC-5576, MAC-8120, MAC-13985, MAC-22272, and 
MAC-30731 (2-6, Fig. 4), with ICs9 values of 0.5-7uM mea- 
sured using Abz-Tyr(NO;) FRET assay [15]. Kao et al. screened 
50 240 structurally diverse small molecules and found 2 com- 
pounds targeting the SARS-CoV 3CL? [31]. One of the 
compounds, MP576 (7, Fig.4), a 3-quinolinecarboxylic acid, 
displayed inhibitory activity with a K; value of 2.5uM in the 
HPLC-based cleavage assay and an ECs value (half maximal 
effective concentration) of 7uM in the Vero cell-based 
SARS-CoV plaque reduction assay without apparent toxicity 
(CCso (half maximal cytostatic concentration) >50 uM) [32]. 
The other compound, MP521 (8, Fig. 4), an imidazolidine, ex- 
hibited a K; value of 11 uM, an EC;, value of 14uM, and a 
CCs9 value >50uM. A library of 960 commercially available 
drugs and biologically active substances were screened by Hsu 
et al. using the cell-based assay to find three inhibitors, namely 
phenylmercuric acetate, thimerosal, and hexachlorophene 
(9-11, Fig. 4), which inhibited SARS protease activity according 
to the Edans-Dabcyl FRET assay [33]. The K; values of phenyl- 
mercuric acetate, thimerosal, and hexachlorophene were deter- 
mined to be 0.7, 2.4, and 13.7 uM, respectively. Phenylmercuric 
acetate and thimerosal containing mercury are used as phar- 
maceutical excipients and more widely used as antimicrobial 
preservatives in parenteral and topical pharmaceutical formula- 
tions [34]. As shown below, other metal-containing com- 
pounds (such as zinc-containing compounds) were also identi- 
fied to be the protease inhibitors. The hexachlorophene 
derivatives were further explored as SARS-CoV 3CL?® inhibi- 
tors with similar inhibitory activity using the Abz-Dnp FRET 
assay by Liu et al. [35]. Chen et al. screened from a natural 
product library consisting of 720 compounds to obtain two 
compounds, tannic acid (ICs, (half maximal inhibitory concen- 
tration) = 3uM) and 3-isotheaflavin-3-gallate (TF2B) (ICs, = 
7M), to be inhibitors of SARS-CoV 3CL? with the Edans- 
Dabcyl assay [36]. These two large compounds (structures not 
shown) belong to a group of natural polyphenols in tea [37]. 
Also from a natural product library, an esculetin-4-carboxylic 
acid ethyl ester (MC8; 12, Fig. 4) with an ICs, value of 461M 
against SARS-CoV 3CLP" was derived from the marine sponge 
Axinella corrugate [38]. The screening was performed by using 
a sensitive red-shifted internally quenched fluorogenic substrate 
for SARS-CoV 3CLP", also based on the FRET between the 
CAL Fluor Red 610 and Black Hole Quencher-1 as a donor-ac- 
ceptor pair. In the cell-based assays, MC8 is an effective inhibi- 
tor of SARS-CoV replication in Vero cells at non-cytotoxic con- 
centrations (ECs9 = 112 uM; CCs9 > 800 uM). 
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Figure 4. SARS-CoV 3CLP" inhibitors obtained from high-throughput screening. 


4.2 From Rational Drug Design 


4.2.1 Peptidomimetics 


4.2.1.1 AG7088 Analogues 


Dragovich et al. synthesized a series of compounds targeting 
rhinovirus 3C protease for the possible treatment of the human 
common cold [39]. The best inhibitor is AG7088, a ketomethyl 
isostere of a tripeptide-conjugated ester (a peptide-like rhino- 
virus 3C protease inhibitor) with an ECso value of 0.013 uM in 
inhibiting the rhinovirus protease [40,41]. Its P1 is a y-lactam 
moiety to mimic the P1 glutamine residue, and P1’ is an 
a,ß-unsaturated ester that acts as a Michael acceptor to inhibit 
the rhinovirus 3C?"® by forming a covalent bond with the active 
site Cys residue. To improve cell membrane permeability, the 
P2 phenylalanine residue was replaced with a methylene iso- 
stere bearing a 4-fluorophenyl substituent. Although it was a 
potent inhibitor of the rhinovirus 3CP™, it was not approved 
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for clinical use due to a lack of obvious advantage in treating 
patients with common cold. He et al. from Agouron Pharma- 
ceuticals, Inc., applied for a US patent entitled “Inhibitors of 
SARS 3C like protease” for the application of the AG7088-like 
compounds with Michael acceptor to inhibit the SARS-CoV 
3CLP”. Two formulae were proposed as potential SARS pro- 
tease inhibitors (13 and 14, Fig. 5) without giving their activity 
data [42]. Kania et al. from Pfizer, Inc., developed a series of 
compounds with the same structural features of y-lactam at P1 
and a isobutyl group at P2 to inhibit SARS-CoV 3CLP®, as de- 
scribed in their world patent entitled “Anticoronaviral com- 
pounds and compositions, their pharmaceutical uses and mate- 
rials for their synthesis” [43]. These compounds were shown to 
provide protection for the Vero cells from HCoV 229E and 
SARS-CoV infection in a viability assay. Although having the 
same structural features, some of these compounds were inac- 
tive (ECs > 100M) against the SARS-CoV and some showed 
inhibitory activity. The best compound of this series displayed 
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Figure 5. Peptidomimetics as SARS-CoV 3CLP" inhibitors. 


an EC;, value of 0.8 uM (15, Fig. 5) to inhibit SARS-CoV repli- 
cation by targeting the SARS-CoV 3CL? without toxicity 
(CCso > 320 uM). A general formula is given in Fig. 5 (16). Rao 
et al. applied for a patent covering a series of AG7088 ana- 
logues with the common structure of compound 17 (Fig. 5) as 
inhibitors of SARS 3CLP"° [44]. Using a continuous fluoromet- 
ric assay with the fluorogenic substrate MCA-AVLQSGFR- 
Lys(Dnp)Lys-NH3;, an effective compound (18, Fig.5) in this 
series inhibited not only the 3CL” of SARS-CoV (K; = 9uM, 
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Kinact (rate of enzyme inactivation) = 0.0031 s'), but also the 
protease of HCoV 229E, TGEV, feline infectious peritonitis 
virus, mouse hepatitis virus, and infectious bronchitis virus 
[45]. This compound exhibited an ECs, value of 4.3 uM against 
mouse hepatitis virus on murine delay brain tumour cells in a 
plaque-reduction assay. It also has low toxicity (500uM 
compound only displayed 28 % inhibition of cell growth). Yang 
et al. discovered a potent inhibitor from the AG7088 analogues 
with cyclic y-lactam at the P1 position, TG-0205221 (K;= 
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0.053 1M for compound 19, Fig. 5), based on the HPLC assay 
[45]. The inhibitor exhibited potent activities in suppressing 
replication of SARS-CoV (4.7 log viral titer reduction at 511M 
concentration, ECs 9 = 0.6uM and cytotoxicity CCs9 >20 uM) 
and HCoV 229E (5.2 log viral titer reduction at 1.25uM con- 
centration, ECs, = 0.14uM). The tight binding between TG- 
0205221 and the SARS-CoV 3CL?” is through a covalent bond, 
hydrogen bonds, and unprecedented hydrophobic interactions 
as evidenced by the crystal structure of the corresponding com- 
plex [46]. Shie et al. found that the replacement of the y-lactam 
moiety in AG7088 with a phenylalanine side chain resulted in 
a more potent SARS-CoV 3CL? inhibitor [47]. As shown in 
Fig.5, the best inhibitor (20) of this series has a K; value of 
0.5uM measured by the Edans-Dabcyl FRET assay. The com- 
pound also showed potent activity (ECs59= 0.2uM) against 
SARS-CoV replication by blocking the synthesis of the viral 
spike protein without apparent toxicity (CCsọ>200uM) in 
cell-based assays [47]. From the computer modelling, the phe- 
nylalanine side chain at P1 and P2 positions may be shifted by 
one position to occupy S2 and S3 sites as the a,ß-unsaturated 
Michael acceptor is not covalently bound with the Cys145. Ac- 
cording to the product-bound crystal structure of the SARS- 
CoV 3CL™ [18], S2 is a hydrophobic pocket to hold the hy- 
drophobic residue, and the P3 side chain is exposed to the sol- 
vent. 


4.2.1.2 Anilides 


A series of peptide anilides with aniline derivatives, such as 2- 
chloro-4-nitroaniline, next to L-phenylalanine were shown to 
inhibit the SARS-CoV 3CL?® and also the viral replication 
[48]. Compound 21 shown in Fig.5 is the most potent inhibi- 
tor of SARS-CoV 3CL?" with a K; value of 0.03 uM using the 
Edans-Dabcyl FRET assay. Deletion of the chloro, nitro, or di- 
methylamino substituents from this compound significantly 
weakened the binding affinity. Also, replacing the dimethylami- 
no group with a nitro group caused a reduction in inhibitor 
potency. According to the molecular docking, the nitro group 
of the compound is predicted to be hydrogen bonded with the 
NH of Ala46, while the chlorine atom is within 3 A from the 
y-S atom of Cys145 and the Ne2 atom of His41, therefore pro- 
viding a possible key interaction with the catalytic dyad. 


4.2.1.3 Keto-Glutamine Analogues 


As the SARS-CoV 3CLP" recognizes a glutamine residue at the 
P1 site, Jain et al. synthesized and evaluated a series of keto- 
glutamine analogues with a phthalhydrazido group at the a 
position as reversible protease inhibitors [49]. Attachment of a 
tripeptide (Ac-Val-Thr-Leu) to these glutamine-based “war- 
heads” resulted in significantly better inhibitors. N,N-dimethyl- 
glutamine analogues are much less potent inhibitors (10- to 
100-fold larger ICs) than cyclic glutamine analogues. The best 
inhibitor (22, Fig. 5) showed an ICs 9 value of 0.6 uM measured 
using an Abz-Tyr(NO,) FRET assay. In the modelling struc- 
tures, the inhibitor is bound in an extended conformation, 
forming a partial -sheet and a hydrogen bond between His163 
and the P1 side chain. The modelling studies indicate that the 
active site of the enzyme has enough room to accommodate 
the bulky phthalhydrazido group. Some rearrangements of the 
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protein, in particular residue Glul66, are required to accom- 
modate the extra bulky group on the P1 residues. 


4.2.1.4 Glutaminyl Fluoromethyl Ketones 


Zhang et al. have designed and synthesized a series of dipeptid- 
yl glutaminyl fluoromethyl ketones (23, Fig.5) to covalently 
inhibit the SARS-CoV 3CL?" by reacting with the active site 
Cys residue [50]. These inhibitors were synthesized by using 
N,N-dimethyl-Gln-fmk as the P1 residue and warhead, in com- 
bination with different P2 amino acids via five-step reactions. 
The dipeptides with hydrophobic residues such as Leu, Val, 
and Phe at P2 position showed a better effect against SARS- 
CoV infected Vero cells, consistent with the protease substrate 
preference. The best inhibitor, as shown in Fig. 5, has an ECs 
value of 2.5uM tested with the Vero cells. The toxicity CCso 
value of the inhibitor is > 100M. However, the compounds 
with Gly and Ala as the P2 amino acid did not inhibit the viral 
replication. 


4.2.1.5 Aziridinyl Peptide 


Similar to fluoromethyl ketones, aziridine and epoxide can also 
react with nucleophilic amino acids within the active site of 
proteases. The screening using HPLC-based and FRET-based 
assays have identified a trans-configuration N-substituted 
aziridine-2,3-dicarboxylate, named (S,S) (C,H;O) Azi-Gly- 
Gly-OBn (24, Fig.5), as a modest inhibitor of SARS-CoV 
3CLP (54% enzyme activity was inhibited by 100uM com- 
pound) [51]. This activity may be enhanced by linking the elec- 
trophilic building blocks to appropriate amino acids (e.g., Gln), 
substrate analogous peptides, or peptidomimetics. 


4.2.1.6 Aza-peptide Epoxide 


An aza-peptide epoxide (25, Fig.5) was also found to inhibit 
SARS-CoV 3CLP® covalently with a K; value of 18uM and 
kinat Of 35x10” s™ [52]. According to the complex crystal 
structure, the epoxide ring of the inhibitor opens upon Cys 
attacking, leaving a hydroxyl group on the C2 atom to form 
hydrogen bonds with the Asn142 O61 atom of the protease 
(2.9 A) and the P2-Phe carbonyl O atom of the inhibitor 
(3.2 A). The configurations of the C2 and C3 atoms are in- 
verted from S,S to R,R. 


4.2.1.7 Miscellaneous 


Shella et al. applied for a patent entitled “Inhibitors of severe 
acute respiratory syndrome (SARS) SARS-CoV 3CLP"” [53]. 
They discovered pharmaceutical peptide-like compounds, in- 
cluding three components (26A, 26B, and 26C, Fig. 5), which 
inhibited SARS and rhinovirus 3C” in a mammal by adminis- 
tering effective amounts of the rhinovirus 3C”® inhibitors. 
However, no assay data are available. Cai et al. found that the 
compounds with the general structure of compound 27 (Fig. 5) 
can inhibit SARS-CoV 3CLP" in SARS-CoV and feline infec- 
tious peritonitis virus, as described in their patent entitled 
“Protease inhibitors for coronaviruses and SARS-CoV and the 
use thereof” [54]. The inventors performed cell-based assays 
using infected cells, such as the murine delay brain tumor cells 
infected by mouse hepatitis virus strain A59 or the Vero cells 
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infected by SARS-CoV, and found that these compounds could 
inhibit the viruses and protect the cells from infection. They 
further suggest the compounds may be administered to mam- 
mals, such as humans, orally at a dose of 0.0025-50 mg kg", or 
an equivalent amount of the pharmaceutically acceptable salt 
thereof, per day of the body weight when the mammals are 
being treated for SARS-CoV or other CoV-mediated disorders. 
Preferably, ~0.001-10 mg kg" is orally administered to treat or 
prevent such disorders. For intramuscular injection, the dose is 
generally ~50 % of the oral dose. 


4.2.2 Non-Peptidomimetic Inhibitors 


4.2.2.1 Isatin 


Certain isatin (2,3-dioxindole) derivatives are known potent in- 
hibitors against the rhinovirus 3C? [55]. This isatin core 
structure offers several advantages that include ease of synthe- 
sis and chemical modification. Its derivatives were tested as in- 
hibitors for SARS 3CLP™. A series of synthetic isatin derivatives 
showed IC;, values in the range of 0.95-17.54M from an 
Edans-Dabcyl FRET assay and confirmed by a HPLC-based as- 
say, and the best inhibitor found is listed in Fig. 6 (28), which 
has an iodine or brom atom in the isatin scaffold [56]. It can be 
concluded that the benzothiophenemethyl side chain provides 
more inhibitory effect than the benzyl, heterocyclic substituted 
methyl, and other alkyl groups. Recently, Zhou et al. have 
reported a series of N-substituted isatin derivatives against 
SARS-CoV 3CL?" [57]. The inhibition activity was measured 
by the continuous colorimetric assay using the colorimetric 
substrate Thr-Ser-Ala-Val-Leu-Gln-pNA. The C-5 position of 
isatin was found to favor a carboxamide group and the N-1 
position to favor large hydrophobic substituents for inhibiting 
the protease. The lowest ICs 9 value (0.371uM) was observed 
among these inhibitors. 


4.2.2.2 Metal-Conjugated Inhibitors 


As shown above in high-throughput screening of 960 com- 
pounds, 2 compounds (phenylmercuric acetate and thimerosal) 
identified as potent SARS-CoV 3CL?” inhibitors contain mer- 
cury (9 and 10, Fig. 4). This suggests the use of a metal ion as a 
chelator for Cys proteases as metal ions such as Hg”, Zn”, 
and Cu” have high affinities to the sulfur atom of the Cys resi- 
due [58,59]. As mercury-containing compounds may pose 
therapeutic hazards to a patient if orally taken, a series of zinc- 
containing compounds and metal ions were evaluated for 
SARS-CoV 3CLP" inhibition [33]. The most potent inhibitor 
found was 1-hydroxypyridine-2-thione zinc, a competitive in- 
hibitor with a K; value of 0.17 uM (29, Fig. 6) with more inhibi- 
tory activity than just Zn?" (K; = 1.1 uM). Zn”* was previously 
found to be tetrahedrally coordinated by three Cys sulfurs and 
one His nitrogen of the 2A protease from a common cold virus, 
which is responsible for the shut-off of host-cell protein syn- 
thesis [60]. Zinc-containing compounds, such as zinc acetate, 
are added as a supplement to the drug for the treatment of Wil- 
son’s disease [61], indicating the safety of the ion for human 
use. Zinc salts such as zincum gluconicum (Zenullose) may be 
effective in treating the common cold, a disease caused by rhi- 


www.ChemBioEngRev.de 


© 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim 


noviruses, without knowledge of the molecular target [62]. 
Moreover, zinc ions inhibit the replication of rhinoviruses 
[63, 64]. Thus, the potential use of the zinc-conjugated com- 
pounds as a therapeutic treatment of the SARS disease needs to 
be explored. 


4.2.2.3 Aryl Boronic Compounds 


Bacha et al. proposed an attractive subsite for the design of po- 
tent inhibitors (30, Fig.6), which is a cluster of Ser residues 
(Ser139, Ser144, and Serl47) close to the catalytic residues 
[65]. In fact, this Ser cluster is conserved in all known CoV 
proteases and may represent a common target of wide-spec- 
trum CoV protease inhibitors. Based on the known reactivity 
of boronic acid compounds with the hydroxyl group in Ser res- 
idues, the inhibitory potency of bifunctional boronic acid com- 
pounds was evaluated. A chemical scaffold containing two phe- 
nyl boronic groups attached to a central aromatic ring by ester 
linkages were tested. This compound has a K; value of 0.041M 
based on an Edans-Dabcyl FRET assay. Different variations of 
the compound were prepared, including several isomers with 
replacement of the central aromatic ring with shorter ester 
linkage and different functionalities at the phenyl boronic 
rings. The highest improvement in affinity was observed when 
the ester linkage between the aromatic rings was replaced with 
an amide group, thereby resulting in nanomolar inhibition 
constants. The inhibitors display a mixed competitive pattern 
and bind to both free enzyme and enzyme substrate complex. 
This may be due to the large substrate used in the kinetic 
measurements. These compounds inhibited the enzyme in a 
reversible manner. Freire et al. provided a series of organic bo- 
ron-containing compounds (compound 31 as a representative 
and 31A, 31B, and 31C as examples in Fig. 6), compositions 
thereof and methods of using such compounds and composi- 
tions for inhibiting CoV proteases and for treating infections 
[66]. The invention provides a method of inhibiting CoV pro- 
teases, treating infections caused by CoV and detecting CoV, 
particularly CoV proteases that have one or more serine or 
threonine residue(s) at or near its active site and proteases of 
SARS-associated CoV. Such methods comprise contacting the 
testing sample obtained from a patient with boron-containing 
compound 31 [66] that have been: i) tethered to an appropriate 
surface such that protease coming into contact and being 
bound to the tethered compound can be detected; ii) labelled 
by fluorescent, radioactive, or other markers that allow identifi- 
cation of CoV protease bound to the compound; or iii) that by 
any other means can be used to detect the presence of CoV 
protease. 


4.2.2.4 Etacrynic Acid 


As shown above, fluoromethyl ketones and aziridinyl peptides 
are active-site-directed inhibitors. These scrutinized com- 
pounds contain electrophilic building blocks (aziridine, epox- 
ide, Michael system) that are known to react with nucleophilic 
amino acids within the active site of proteases. Derivatives of 
etacrynic acid, a well-known diuretic drug containing an acti- 
vated double bond, were also shown as non-peptidic SARS- 
CoV 3CL? inhibitors. A comprehensive screening with an 
HPLC-based assay was performed for SARS-CoV 3CL?®. The 
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Figure 6. Non-peptidomimetic inhibitors of SARS-CoV 3CLP"®, 


most promising inhibitor, etacrynic acid tert-butylamide 
(32, Fig.6) inhibits the protease (K; = 35.3uM, measured 
through a FRET assay). 


4.2.2.5 Benzotriazoles 


A class of stable benzotriazole esters were reported as mecha- 
nism-based inactivators of the SARS-CoV 3CL? [67]. During 
the preparation of a library of lopinavir-like compounds, var- 
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ious carboxylic acids were coupled with the amines in the pres- 
ence of 2-(1H-benzotriazole-1-yl)-1,1,3,3-tetramethyluronium 
hexafluorophosphate (HBTU) and the intermediates of the 
acids with HBTU were found to inhibit the protease. The com- 
pounds were purified and characterized and the most potent 
inhibitor (33, Figure 6) showed a K; value of 7.5nM. It is a 
covalent inhibitor and specifically acylates the Cys145 residue 
with a kinact of 0.0011 s™. 
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4.2.2.6 Glycopeptides 


Balzarini et al. patented novel glycopeptide antibiotic deriva- 
tives, such as compound 34 (Fig. 6), for the processes of their 
preparation, their use as a medicine, their use in treating or 
preventing viral infections and their use in manufacturing a 
medicine to treat or prevent viral infections [68]. The invention 
covers the use of glycopeptide antibiotics including vancomy- 
cin, teicoplanin, ristomycin, eremimycin-type aglycons, and 
their semisynthetic derivatives to treat or prevent viral infec- 
tions, especially the infections by the viruses belonging to Ret- 
roviridae, Herpesviridae, Flaviviridae, and Coronaviridae, such 
as HIV, hepatitis C virus, bovine viral diarrhoea virus, SARS- 
CoV, feline CoV, herpes simplex virus, varicella zoster virus, 
and cytomegalovirus. 


4.3 By Computer Modelling 
4.3.1 Phenylsulfonyl Pyridine 


Lu et al. performed a structure-based virtual screening of a 
chemical database containing 58855 compounds followed by 
testing the potential inhibitors against SARS-CoV 3CLP"° to 
identify 2 hits [69]. One of the hits was a 6-methoxy-3-nitro- 
2-(phenylsulfonyl) pyridine with an ICso value of 0.3 uM 
(35, Fig.7) measured using Edans-Dabcyl FRET assay. The 
core structure (diphenyl sulfone) of this compound was used 
to search for other analogues and the analogues were mea- 
sured for their inhibition constants against SARS protease. 
Wu et al. have filed a patent that relates to a method for 
modulating activity of SARS-CoV 3CLP” or an analogue 
thereof by contacting the protein with an effective amount of 
the series of diphenyl sulfone analogues [70]. This invention 
is based on the discovery that a group of dicyclic or multi-cy- 
clic compounds effectively inhibit the activity of SARS-CoV 
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3CLP" and hepatitis C virus NS3 protease, a structural ana- 
logue of SARS-CoV 3CLP"”, 


4.3.2 Oxadiazol-imidazole 


Another compound identified in virtual screening by Lu et al. 
was 2-({[3-(4-chlorophenyl)-1,2,4-oxadiazol-5-yl]methyl}thio)- 
4,5-dihydro-1H-imidazol-3-ium chloride with an IC;, value of 
3uM against SARS-CoV 3CLP" (36, Fig.7) [69]. The core 
structure as shown in bold was used to search for the structural 
analogues; nine compounds with moderate structural similarity 
were found to inhibit the SARS-CoV 3CLP" [69]. 


4.3.3 N-phenyl-2-(pyrimidinylthio)acetamide 


Using virtual screening and 3D-QSAR studies, Tsai et al. dis- 
covered a group of SARS-CoV 3CL?® inhibitors called Nphen- 
yl-2-(pyrimidinylthio) acetamide [71] and filed a patent appli- 
cation [72]. The most potent one (37) of these analogues, with 
an ICs value of 3 uM from an Edans-Dabcyl FRET assay, is 
shown in Fig. 7. The core structure is highlighted in bold. 


4.3.4 Cinanserin 


Chen et al. performed virtual screening on the existing medical 
database CMC (Comprehensive Medicinal Chemistry, MDL 
Information System, Inc.) for SARS-CoV 3CLP" inhibitors by 
using its three-dimensional structure. Cinanserin (38, Fig. 7) 
was identified to inhibit the protease [73] and two patents were 
filed [74,75]. The ICso values of cinanserin for inhibiting 
SARS-CoV 3CL” and HCoV 3CL protease were found to be 
4.9 and 4.7 uM, respectively, using an Edans-Dabcyl FRET as- 
say. Cinanserin was also found to block SARS-CoV replication 
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Figure 7. Inhibitors of SARS-CoV 3CLP"° derived from computer modeling. 
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in tissue culture. Treatment of the cells with 50 ug- 
ml! (134 uM) cinanserin reduced the SARS-CoV 
RNA concentration by >3 log units with an EC; 
value of 31 uM. The reduction of the titre of infec- 
tious particles exactly corresponded to the reduc- 
tion in virus RNA concentration. No toxicity of the 
compound was detected in the concentration range 
tested (< 130 uM). 


4.3.5 Calamidazolium Compound 


Lai et al. used computer modelling to simulate a 
three-dimensional structure of SARS-CoV 3CLP"° 
and obtained calamidazolium (39, Fig. 7) as a po- 
tential inhibitor [76]. HPLC assays were further 
used to verify that the compound indeed inhibits 
the enzyme. 


4.3.6 5-Benzylidene-4-0xo-1,3-thiazolidin b) 
Analogues 


Shen et al. used the structure (PDB ID 1LVO) of 
TGEV MP" as the template to simulate the struc- 

tural model of SARS-CoV 3CL?” and identified the 
active site. Through molecular docking, they 
screened compounds from a commercial data bank 

for active-site inhibitors. A series of 5-benzylidene- 
4-oxo-1,3-thiazolidin analogues with the core 
structure as compound 40 (Fig.7) were found to 5 
inhibit the recombinant SARS 3CLP" and prevent 

the virus infection on Vero-E6 cells [77]. 
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5 Engineering 
of SARS-CoV 3CLP"° 


In addition to serving as a promising target in anti- 
SARS chemotherapy, SARS-CoV 3CLP™ possesses 
potential as a tag-cleavage endopeptidase. As anal- 
ogous to picornaviruses 3C proteases, 3CLP™ has 
substrate specificity in cleaving the amide bond be- 
tween P1-Gln and a small amino acid such as Ser, 
Ala, or Gly at P1' [78,79]. According to the struc- 
ture of SARS-CoV 3CLP" [10,18], this small P1’ 
residue is near Thr25, which likely limits the sub- 
strate specificity. Since the recombinant SARS-CoV 
3CLP® can undergo auto-processing [18,80], a 
modification on the S1’ might tolerate a large amino acid at 
Pl’ position. In fact, SARS-CoV 3CLP” with T25G mutation 
expands the SI’ site and, thus, is able to cleave peptides with 
larger amino acids such as Met (the starting amino acid of most 
native proteins) at Pl’ [81] (Fig. 8a). Indeed, compared with 
the keat of 1.6 + 0.2 min” and the Km of 76.6 + 3.5uM for the 
wild-type, the T25G mutant displays the keat of 16.2 + 0.5 min™ 
and the Km of 18.6 + 2.4uM (43.5-fold higher kcat/Km) against 
the SAVLQJMGERK substrate. Moreover, vectors that contain 


www.ChemBioEngRev.de 


© 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim 


m me 
SARS-CoV 3CLPr° 
recognition sequence 


Figure 8. Engineering of SARS-CoV 3CLP as an endopeptidase. (a) Structural 
basis for Thr25 mutation. Predicted structural model of SARS-CoV 3CLP” with a 
modified peptide containing Met at P1’ (Thr-Ser-Ala-Val-Leu-Gln-Met-Gly-Phe- 
Arg-Lys, as indicated), based on the crystal structure of the SARS-CoV 3CLP" 
H41A mutant in complex with a peptide (PDB ID 2Q6G). (b) Schematic illustra- 
tion of the expression of a target protein with a tag and the engineered cleav- 
age site. The fusion protein can be cleaved by T25G mutant SARS-CoV 3CL?” be- 
tween AVLQ and Met at P1’ (upper panel); strategy of using Pstl as a 5’ cloning 
site (CTGCAG) that is part of the sequence GCGGTGCTGCAG encoding the pro- 
tease recognition sequence AVLQ (lower panel). 


the nucleotides encoding the T25G recognition site AVLQJM 
between the tags and the N-terminal Met of the target proteins 
have been constructed and expressed by the prokaryotic and 
eukaryotic hosts. In these vectors, PstI (CTGCAG) is chosen as 
a 5’ cloning site, since its sequence overlaps with the nucleotide 
sequence (GCGGTGCTGCAG) encoding the protease recogni- 
tion site (Fig.8B). Identical 5'-PstI/3'-XhoI cloning sites in 
these vectors can be used to allow sticky-end DNA fragments 
of the target genes generated by PCR [82] to ligate with these 


ChemBioEng Rev 2015, 2, No. 2, 118-132 128 


Wiley Online Library 


vectors simultaneously (parallel cloning method) [83,84]. 
These engineered vectors and the T25G mutant can be as- 
sembled as a kit for the maximal production of soluble and 
functional proteins with authentic sequences (see the patent 
entitled “Protein expression system involving mutated severe 
respiratory syndrome-associated coronavirus 3C-like protease“ 
by Liang et al. [85]). 


6 Conclusions 


Since viral protease inhibitors are remarkably effective at block- 
ing the viral replication and virus-induced cytopathic effects, in- 
hibiting 3CL?® proteolysis is a convincing strategy against 
SARS-CoV. In this review article, the effective inhibitors against 
SARS-CoV 3CLP"” are summarized and the methodology for 
detecting the protease activity and inhibitor screening are dis- 
cussed. Most of the peptidomimetic inhibitors of the SARS-CoV 
3CL?” are designed based on its substrate specificity of P1-Gln, 
but some have Phe to replace the Gln moiety. For covalent in- 
hibitors, the electrophilic structural features, such as Michael ac- 
ceptor (@,ß-unsaturated ester), fluoromethyl ketone, aziridine, 
epoxide, and etacrynic acid, are built in to react with the active- 
site Cys residue. The metal-containing compounds also target 
the Cys145 residue. Other compounds that have been suggested 
to bind to the active site of SARS protease based on computer 
modelling are potential inhibitors. All the inhibitors for 
SARS-CoV 3CL?° may also be used in inhibiting other viral 
proteases due to some degree of structural similarity of their ac- 
tive sites. In fact, some of the SARS-CoV 3CLP"® inhibitors are 
derived from previously identified rhinovirus 3C?" inhibitors 
such as AG7088 (now called Rupintrivir). Rupintrivir is a potent 
inhibitor of 3CP™ based on in vitro and in vivo studies. Its ana- 
logues modified based on the structural differences between 
3CP and 3CLP™ represent promising leads for anti-CoV agents. 

Several classes of 3CLP"® inhibitors can also inhibit picorna- 
viral 3CP'° [86]. Structural studies have revealed that 3CP™° is a 
monomer, whereas 3CLP"° is a dimeric protein which has an 
extra third domain for dimer formation, although their active 
sites are superimposed very well [87]. Moreover, 3CP™° has a 
more open but shallow S2 site, as well as a smaller S3 site, 
which account for different inhibitory specificity, as concluded 
from the structural comparison of the same inhibitor in the 
3CP and 3CLP" active sites [86]. Moreover, these inhibitors 
could possibly serve as new drug leads for other viral diseases 
such as those caused by rhinovirus, HCoV 229E, enterovirus, 
transmissible gastroenteritis virus, feline infectious peritonitis 
virus, mouse hepatitis virus, and infectious bronchitis virus. 
Particularly, the recently emerged SARS-like coronavirus, 
human betacoronavirus 2c EMC/2012 (HCoV-EMC), also 
called Middle East respiratory syndrome coronavirus (MERS- 
CoV), has a 3CL? structurally similar to SARS-CoV 3CL?° 
and the protease can be inhibited by N3 (18, Fig. 5) [88], which 
is a Rupintrivir analogue inhibiting SARS-CoV 3CLP". More- 
over, CE-5, a small-molecule inhibitor of 3CLP"° of SARS-CoV 
[89] also inhibits the activity of MERS-CoV 3CL?” [90]. There- 
fore, the information discussed in the present review helps to 
ultimately develop drugs against infectious CoVs and picorna- 
viruses, which should be continuously pursued. 
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A mutant (T25G) of SARS 3CLP™ can recognize Ala-Val- 
Leu-Gln\Met and cleave the fusion proteins efficiently. As a re- 
sult, it can serve as a novel endopeptidase for tag cleavage when 
the fusion proteins expressed in E. coli and yeast Pichia contain 
the AVLQ sequence immediately before their own N-terminal 
Met residue. Moreover, the mutant protease also showed simi- 
lar cleavage efficiency to the commonly used FXa in cleaving 
the tag under the same condition. Other vectors with different 
expression systems including baculovirus-infected insect cells, 
mammalian cells, and cell-free system, could also be con- 
structed to contain the same protease recognition site. The 
common 5’-PstI/3'-Xhol restriction sites can be used for paral- 
lel cloning of the sticky-end PCR product of a target gene into 
these vectors. These engineered vectors and the mutant pro- 
tease can be assembled as a kit for parallel expression of a tar- 
get gene in multiple hosts simultaneously to screen for the 
most optimal condition to produce a soluble and functional 
protein with maximal yield. The fusion protein can then be 
purified with the affinity column against the tag (e.g., NiNTA 
for His-tag) and the tag can be removed by T25G 3CL”” to 
yield the recombinant protein with authentic sequence. 
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Symbols used 


Kp [M] dimer dissociation constant 

K; [M] inhibition constant 

Kinact [s] rate of enzyme inactivation 

Abbreviations 

3CLP° 3C-like protease 

3cr° 3C protease 

AUC analytic ultracentrifugation 

CCs9 half maximal cytostatic concentration 

CMC Comprehensive Medicinal Chemistry 

CoV coronavirus 

E envelope 

ECs half maximal effective concentration 

FRET fluorescence resonance energy transfer 

HBTU 2-(1H-benzotriazole-1-yl)-1,1,3,3-tetramethyl- 
uronium hexafluorophosphate 

HCoV human coronavirus 

ICso half maximal inhibitory concentration 

M membrane glycoproteins 

MP» main protease 

N nucleocapsid protein 

nsp nonstructural protein 

ORF open reading frames 

RP-HPLC reverse-phase high-pressure liquid 
chromatography 

S spike 

SARS severe acute respiratory syndrome 

TF2B 3-isotheaflavin-3-gallate 

TFA trifluoroacetic acid 

TGEV transmissible gastroenteritis virus 
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